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^^ . We introduce quantum versions of the x^-divergence, provide a detailed analysis of their properties, and 

^ ' apply them in the investigation of mixing times of quantum Markov processes. An approach similar to the one 

^ I presented in Ql-fj] for classical Markov chains is taken to bound the trace-distance from the steady state of a 

^~~i . quantum processes. A strict spectral bound to the convergence rate can be given for time-discrete as well as for 

("—^ ' time-continuous quantum Markov processes. Furthermore the contractive behavior of the x'^ -divergence under 

r\l I the action of a completely positive map is investigated and contrasted to the contraction of the trace norm. In this 

, context we analyse different versions of quantum detailed balance and, finally, give a geometric conductance 

I I ' bound to the convergence rate for unital quantum Markov processes. 

r^ ■ 

ti ■ I. INTRODUCTION 

c ■ 
a ■ 

^ I The mixing time of a classical Markov chain is the time it takes for the chain to be close 

O^. to its steady state distribution, starting from an arbitrary initial state. The ability to bound 

the mixing time is important, for example in the field of computer science, where the bound 

p^ . can be used to give an estimate for the running time of some probabilistic algorithm such 

^ ' as the Monte Carlo algorithm. The mixing time for a classical Markov process Pij, with 

OO I J2i Pij = 1 on the space of probability measures S is commonly defined in terms of the 

'^ ■ one norm, ||p|| i — ^ ■ \pi\. Let n denote the fixed point of the classical Markov process, i.e. 

^\j I Pn ~ TT, then the mixing time is defined as: 

in ■ t„,„(e)==min{n|V(7e5, ||P"(7-^||i <e}. (1) 

o ; 

^D , A large set of tools has emerged over the years that allows to investigate the convergence rate 

'^ ' of classical Markov chains ^. One of the most prominent approaches HHsl] to bounding the 

^ , mixing time of a Markov chain is based on the x'^ -divergence. This divergence is defined for 

two probability distributions p,q E S as. 



X P,9)=>^ ■ (2) 

i * 

The usefulness of the x^-divergence for finding bounds to the mixing time of classical 
Markov chains arises from the fact that it serves as an upper bound to the one norm difference 
between two probabiUty distributions, 

\\p-q\\l<x^ip,q) (3) 

and allows for an easier access to the spectral properties of the Markov chain. The x^- 
divergence is intimately related to the Kullback-Leibler divergence, or relative entropy, 
H{p, q) — ^ • Pi (log Pi — log qi). In fact, it can be obtained directly from the relative entropy 
as the approximating quadratic form, i.e. as the Hessian, of the latter: 

X^ijP.q) = --^-7^H{q + a{p ~ q),q + I3{p - q)) \o,=f}=Q. (4) 

OaOp 



The x^ divergence was first introduced by Karl Pearson in the context of statistical inference 
tests, the most widely used of which is the "Pearson's x^ test". Its computational simplic- 
ity and its clear relation to other distance measures have made it one of the most studied 
divergence measures in the literature. 

In this paper, we find convergence bounds for arbitrary quantum Markov chains, also 
called quantum channels, with a technique that can be seen as a generalization of the work 
of lHI-lSl] to non-commutative probability spaces. A prototypical example of mixing time in 
physics is the decoherence time of the underlying quantum process, i.e. the time in which 
quantum states decohere to an (often classical) mixture given a specific underlying noise 
model. The ability to bound the mixing time for quantum processes also turns out to be 
relevant when one seeks to give bounds on the runtime of quantum algorithms that are based 
on quantum Markov chains |5, 6]. Other applications of such bounds can be found in the 
framework of matrix product states (Qlalj where the correlation length of the quantum state 
is connected to the convergence of the corresponding transfer operator that can be interpreted 
as a quantum channel. In this article, we introduce the mathematical framework necessary to 
extend the classical mixing time results to the quantum setting. In particular, we introduce 
a new divergence measure - the quantum x^ -divergence - for quantum states and use it to 
obtain some basic convergence bounds that mirror existing classical ones. Furthermore, 
we extend the classical concept of detailed balance to the quantum setting and discuss its 
relevance in general terms. 

The paper is organized as follows; The remainder of sectionUis devoted to setting the no- 
tation and to recalling the framework of quantum channels. Then in section|lIl we introduce 
the quantum x^ -divergence, and prove some basic properties relating it to other divergence 
and distance measures. In particular we focus on a specific subfamily of interest. In section 
Hm we consider contraction of the x^ -divergence under the action of a channel, and relate it 
to trace-norm contraction. Furthermore, we prove some fundamental quantum mixing time 
results, whose classical analogues are well known. In section IIVI we study quantum de- 
tailed balance, and in section |V] we extend an important classical geometric mixing bound 
(Cheeger's inequality) to the quantum setting. Conclusions are drawn in I VII 



A. Formal setting and notation 

Throughout this paper we will consider linear maps from the complex d-dimensional 
matrix algebra A^c( to itself. The states are density matrices p £ Sd, where Sd = 
{p € Md\p = p\p > 0,tr[/3] = 1 }, acting on io = (C^. The set of pure states is denoted 5^, 
while the set of positive definite states is denoted 5j". Note, that M.d itself becomes a Hilbert 
space when equipped with the standard Hilbert-Schmidt scalar product {A\B) = tr[^^B]; 

,2 

this Hilbert space is naturally isomorphic to Md — (D . The eigenvalues and singular values 

^ ,2 

of T are understood in terms of the matrix representation T e M.d'^ of T acting on (D . 
The matrix representation of a quantum operation, which will always be written with a hat 
(ex. T), is given in terms of some complete orthonormal basis {-Fi}i=i...rf2 of M.d, where 
its matrix elements are T^ = {Fi\T\Fj). Unless otherwise specified, we consider the basis 
of matrix units. The distance between states pi, P2 G Sd will be measured in terms of the 
trace distance \\pi — pi\\\ induced by the trace norm ||A||i = tr[vAl\A] = ^^ Si{A) for 
Si{A) the singular values of A G M^d- Time discrete quantum Markovian dynamics are de- 
scribed by completely positive, trace-preserving maps (cpt-maps, or quantum channels) H 
T : Jvid !—> M^d- Due to the Kraus representation theorem, every cp-map can be expressed 



in terms of the Kraus operators A^ £ Md, as 

T(p)-^A^p4 with, ^4^^ = 1, (5) 

The dual map T* can be seen as the hermitian conjugate of T with respect to the Hilbert- 
Schmidt scalar product. In the above matrix representation of the map, this corresponds 
exactly to taking the hermitian conjugate T^. A quantum channel is called unital, or doubly 
stochastic, when the dual map T* is also trace preserving. This immediately implies that 
T has a fixed point a — l/tr[l]. We will also consider time continuous quantum Markov 
processes, described by a one-parameter semi-group 

r,(p)=e*^(p) (6) 

The LiouvilUan C, obeys £*(1) = flU], and its matrix representation C is obtained as in 
the time-discrete case. In this article, we shall often consider primitive quantum channels, 
i.e. channels with a unique maximal rank fixed point, and a unique eigenvalue of magnitude 
id. 

II. THE QUANTUM x^-DIVERGENCE 

We want to define a generalization of the classical x-^ -divergence to the case when we are 
working on spaces with non-commuting density matrices. We shall require that any gener- 
alization to the setting of density matrices satisfies the condition that when the inputs are 
diagonal, the classical x^ -divergence is recovered. The first observation we make, reading 
straight off from (|2]), is that the classical x^-divergence can be seen as an inner product on 
the probability space weighted with the inversion of the distribution qi. Due to the non- 
commutative nature of density matrices there is no unique generalization of this inversion. 
Consider for instance a generalization for two density matrices p,a £ Sd, where for now we 
assume a to be full rank, that is given by 

Xi(p,a)=tr[(p-a)a-"(p-a)a"-i]. (7) 

This gives rise to an entire family of x^ -divergences with (as we see below) special proper- 
ties, for every a G [0, 1]. The natural question of whether there exists a classification of all 
possible inversions of cr, was investigated in a series of papers by Morozova and Chentsov 
ll30ll Petz Ill3l4l5ll . in the context of information geometry. They considered the characteriza- 
tion of monotone Riemannian metrics on matrix spaces. Their general definition is based on 
the modular operator formalism of Araki lUTl II8II , which we will also consider here. In order 
to classify the valid inversions, we first need to define the following set of functions, each of 
which gives rise to a possible inversion: 

/C = {fc| — fc is operator monotone, k{w^^) = wk{w), and fc(l) = 1}. (8) 

Now, we define left and right multiplication operators as Ly{X) = YX and Ry{X) = 
XY respectively. The modular operator is defined as 

Ap,, = LpR-\ (9) 

for all p,(J £ Sd, (T > 0. Note, that i?(j and Lp commute and inherit hermicity and positivity 
from p,(T. The above should be read as follows: acting on some A E Md, Ap^a-{A) = 
pAa~^ . When manipulating the modular operators it is often convenient to write them in 
matrix form, in wich case, they read: h.p^a\A) = p ® 'W^^\A), where \A) ~ A® t\I), and 
|/) = X)i=i N) corresponds to d times the maximally mixed state. This formalism gives rise 
to a more general quantum x^-divergence. 



Definition 1 For p,(t E Sd, and k E JC we define the the quantum y^ -divergence 

xl{p,a) = {p-<7,n',{p-<j)), (10) 

when supp{p) C supp{a), and infinity otherwise. The inversion inversion of a is defined 
only when supp{p) C supp{a), and given by 

^l - i?-ifc(A,,,). (11) 

The functions ka{w) = ^ (w^" + w"^^) yield the family of Xq -divergences given in (|7]l 
which we call the mean a-divergences to distinguish them from the well-known family 
of WYD a-divergences, which are described in in Appendix |A] along with several other 
families. Although we focus on the family dTJ, most of our results hold for any divergence 
given by ( fTOl l with k E K, with the exceptions of Theorem [l4l 



A. Monotone Riemannian metrics and generalized relative entropies. 

This definition of the x^ -divergence stems from the analysis of monotone Riemannian 
metrics. By Riemannian metric, we mean a positive definite bilinear form Ma-{A,B) on 
the hermitian tangent hyperplane TV = {A E Aid ■ A = A\ti[A] = 0}. The metric 
is monotone if for all quantum channels T : Aid >-^ Aid, states a E S^ and A E TV, 
Mr(cr)(T(^),r(A)) < M„{A,A). Petz showed showed that there is a one-to-one corre- 
spondence between the above metrics and a special class of convex operator functions, which 
correspond to 1/fc in our notation. He furthermore was able to relate several generalized 
relative entropies (which he defined much earlier Ill2ll and referred to as quasi-entropies) to 
monotone Riemannian metrics [14, 15]. The reverse implication, that every monotone Rie- 
mannian metric stems from a generalized relative entropy was first proved by Lesniewski and 
Ruskai jlql . Taking advantage of the well-known integral representations of operator mono- 
tone and convex functions [19] one can express the x^-divergences as well as the relative 
entropies explicitly. We shall briefly repeat the key points of the analysis that are necessary 
for our understanding of the mixing-time and contraction analysis for cpt-maps. 
We need to consider the class of functions Q by which we denote the set of continuous oper- 
ator convex functions from IR+ to M that satisfy g{l) = 0. Note that these functions can all 
be classified in terms of the integral representation: 

giw) = aiw - 1) + 6(w - 1)^ + c ^""'^^ + / ^"^ ~ ^^ du{s), (12) 

w Jo w + s 

where a,b,c > and the integral of the positive measure di'{s) on (0, oo) is bounded. The 
generalized relative entropy for states p,a E 5j" was first defined in ll20ll2l|] . 

Definition 2 Let g E Q. The generalized quantum relative entropy is given by 

Hg{p,<j)=tT[p^/'g{A,^,){p'/')] (13) 

when supp{p) C supp{a), and infinity otherwise, and where Ap g. is again the modular 
operator 

We now recall without proof a theorem [W-U&j relating the relative entropy and the monotone 
Riemannian metric, mirroring the classical result ©I 



Theorems For every k € IC, there is a g £ Q such that for a given a G Sd, and A, B 
hermitian traceless, we get: 



M^{A,B) = —l^Hg{a + aA,a + l3B) 



(14) 



a=/3=0 



{A,nl{B) 



and, k is related to g by 



g{w)+wg{w 1) 

[w — ly 



From this theorem follows a convenient integral representation of the inversion J7^, which is 
equivalent to (HB ifiell . 

where A'g denotes the singular measure A'g(s)(is = [hg + Cg)5{s)ds + dvg{s). Note, that the 
relationship between k and g is not one-to-one. Indeed, by setting g{w) = wg{w^^), we get 
back the above relation. However, there is a one-to-one correspondence between each k and 
a symmetric gs{w) ~ g{w) + wg{w^^), and hence between each metric and a symmetric 
relative entropy. 
Note that the a-subfamily of ^ has the associated symmetric relative entropy: g^y"^ (x) = 

^i^^ (w"-i + w-°'), SO that 

Hr\P,'^) = \{H^{p,a)+H^{a,p)) (17) 



where, 



H^{p, a) = tr[p^-"CT"-^ + p^+"cr-" - 2p"CT^^"]. 



The integral representation ( fTSI l of the inversion fJ^ allows for a partial ordering of different 
monotone Riemannian metrics that follows from the set of inequalities: 

*> 1 + s, 1 1, x + l 

<^^(^ + ^)<^^- (18) 



a; + l 2 s + X sx + 1 2x 

for s G [0, 1], and x G R^. We therefore see that there exists a partial ordering for the 
inversions, with a lowest and highest element in the hierarchy. The lowest element gives rise 
to the so called Bures metric. Thus, 

^Bures ^ 2(i?^ + i,)-l < (l^ < [L-^ + R-^)/2 = fl^" (19) 

The x^-divergence is recovered from the metric upon setting x|(p, a) = M^{p ~ a,p ~ a). 
We are therefore left with a partial order for all possible x^-divergences with a smallest and 
largest element according to, 

X^iures (P,0-)<Xfc(P,CT)<xLo(P'Cr). (20) 

The defining attribute of the above set of metrics is their monotonicity under the action of 
quantum channels. This was first shown by Petz 1151 . and later a proof based on the integral 
representation of fij (fT6] l. and on Schwarz-type inequalities, was provided by Ruskai and 
Lesniewski in lll6ll . Due to its importance for the mixing time analysis we shall repeat it here. 



Theorem 4 For all a G Sd, M^ is monotone under the action of a quantum channel T : 
Md — > Md for all k £ K. and A e Md, i-e. 



M^{A,A)>M^^^^{T{A},T{A)) 



(21) 



Proof: The monotonicity follows immediately from the integral representation of the inver- 
sion 57^ in (fT6] l. and an argument proved in the appendix. Theorem |2T| The theorem states 
that for every channel T and for arbitrary A, we have 



tr 



At- 



Rr, 



sL„ 



-A 



tr 



TMt- 



Rn 



sL„ 



-A 



> tr 



r(A)t 



R 



na) 



sL 



■T{a) 



-T{A) 



(22) 

n 



B. Properties of the quantum x -divergence 

The fact that the quantum x^-divergence can be used to bound the mixing time lies in the 
following Lemma, that upper bounds the trace distance which is the relevant distance measure 
in the mixing time definition. 



Lemma 5 For every pair of density operators p, cr € Sd, we have that 



(23) 



Proof: If the support of p is not contained in the support of a, then the right hand side 
is cx). We can therefore assume w.l.o.g. that cr > by restricting the analysis to the sup- 
port space of (7. The trace norm ||A||i of some matrix A E Md can be expressed as 
ll22ll ||A||i = max[/g[/((^') tr[t/A], where the maximum is taken over all unitaries acting on 
the d-dimensional Hilbert space. Thus, for any inversion J7^: 



l^ll? 



max tr[[/Al^ 

U<£U{d) 



max tr 

ueu{d} 



C/[f}^]-i/2„[f7^]i/2(yl) 



= max tr 

ueuid) 



[n, 



fel-l/2 



{U)[n';]'/\A) 



(24) 



< tr[Atf7j(A)]^max^tr[C/t[f7^]-i(C/)] 



Let us consider the Bures inversion given by il 
-1 



Rn 



is [f7f '"^'=''] ' = i [L, 



2[L. 

yu, 

tr [L/t[r!f"'^^^] -!([/)] = ^ {tr[U^aU]+tT[U^Ua]) = 1 



Ra]- Therefore, for any unitary U, 
1 



Clearly, its inverse 



(25) 



Setting A — p ~ a and observing that Xsures — xt for all k £ IC completes the proof. D 
We are also able to bound the relative entropy in terms of the x^ -divergence. 



Proposition 6 For every pair of density operators p,(T € Sd, we have that 

S{p,(t) <xlip,cr), 
where S{p, a) = tr[p(logp — logir)] is the usual relative entropy. 



(26) 



Proof: In l'26'l, it was shown that for any 7 G (0, 1], 

5(p,a)<i(V+V-^-l) (27) 

7 

Now, consider the basis in which a is diagonal, and write a — X^jt M'i;|^)(^l '^'^^ P = 
Efci''fenfc)('l-Then, 

= 2 5^ ^:^ - ,,, - 2(4£ - r,,) (28) 

where the penultimate inequality is obtained by rearranging terms and by dropping the posi- 
tive terms Yl,ki TkiTikl^m + A*fc) with k^l. D 
We have already stated that the family of Xq -divergences defined in (|7]i can be cast into 
the general framework of monotone Riemannian metrics. Because of its computational sim- 
plicity, and its special symmetry when a = 1/2, we consider its properties more specifically. 
It is possible for instance to show monotonicity of this subfamily using arguments from joint 
convexity. As the proof is interesting in its own right, we give it here: 

Proposition? Xq is jointly convex in its arguments for a G [0, 1]. Moreover, it is monotone 
w.r.t. completely positive trace-preserving maps, i.e., 

xUp,<j)>xUTip),T{<j)), (29) 

for every quantum channel T : Add ^ Md- 



Proof: A direct application of Corrolary 2.1 in 02711 guarantees that Xq(Pi cr) is jointly con- 
vex in its arguments for any a £ [0,1]. This in turn implies monotonicity w.r.t. cp-maps by a 
standard argument: let us represent T as T{p) = tr^ [U{p (8) V')^^^] where ijj is a pure state 
(i.e. rank-one projection), U a unitary and tr^ the partial trace over an 'environmental' sys- 
tem of dimension rn. If we take a unitary operator basis {Ki}i=i....m2 in Mm (orthonormal 
w.r.t. the Hilbert- Schmidt inner product), we can write 

_. m 

T{p) 1„/to = — ^(1 ® Vi)U{p ® i>)U\l ® V^). (30) 

i=l 

However, since Xq {T{p), T{a)) — Xa (^(p) '^ '''i '^(f) "^ ''') in particular for r = Im/m, 
we can now apply joint convexity. With the help of the fact that for any unitary W it holds 
that (W ■ W'^)"' = W{-)"'W^ we obtain the claimed result. D 

Furthermore, we note that this subfamily also has a natural ordering. 

Proposition 8 For every p,cr £ Sd, Xa '* convex in a, and reaches a minimum for a = 1/2. 

Proof: First note that Xq=o(P'''') — Xa=i{P^'^)- That the minimum is reached for a — 1/2 
follows directly from the Cauchy-Schwarz inequality. Applied to our problem we get 



tr 



pa-i/2pa-i/2 



2 



tr /,<7("-i)/V-"/2pa("-i)/V-"/2 

l2 



2 



(31) 



< tr[pa-" pa°'-^]' 



To see convexity, consider the second partial derivative of Xa with respect to a: 

= E^rVr(logMfc-log/iO'l(fc|plOI'>0 (32) 

M 

where we used a = ^^/Ltfc|fc)(fc|. D 



III. MIXING TIME BOUNDS AND CONTRACTION OF THE x^ -DIVERGENCE UNDER 

CPT MAPS 

A. Mixing time Bounds 

The x^-divergence is an essential tool in the study of Markov chain mixing times, because 
on the one hand it bounds the trace distance, and on the other it allows easy access to the 
spectral properties of the map. The subsequent analysis can be seen as a generahzation of the 
work presented in y^^J to the non-commutative setting. 

Theorem 9 (Mixing time bound) Let T : Md i— )■ Md be an ergodic quantum channel with 
fixed point a G Sd, for any p Cz Sd and any k G K., we can bound 

\\T-{p)-<j\\,,<(s'X^fdM- (33) 

Here s^ denotes the second largest singular value (the largest being 1) of the map 

Qk^[n';]'/^oTo[n';]-'/^ (34) 

Before we prove theorem (|9|l, we would like to point out an important fact that regards the 
the singular values of Qk- The monotonicity of the x'^-divergence ensures, that the singular 
values s^ of Qk are always contained in [0, 1] irrespectively of the choice of k E K,. Let us 
therefore prove the following: 

Lemma 10 (spectral interval) The spectrum of the map Sk = Qt-o Qk ~ [rij]^^/^ oT* o 
n'^oT o [Jl^]^^/^ is contained in [0, 1]. 

Proof: Let us first note, that the map Sk is hermitian and positive by construction. Fur- 
thermore, the monotonicity of the x^ -divergence, as stated in theorem (|4|i ensures that the 
Rayleigh-Ritz quotient is bounded by 1. This holds, since \/B 

{B,SkiB)) - {A,T* on';oT{A)) = a4^^){T{A),T{A)) < (35) 

M^iA,A)^{A,nUA)) = {B,B), 

where we defined the intermediate state A = [n,'^]~^/^{B). Note that we made use of the 
fact that (T — T{a) is the fixed point of the map. Therefore 

{B,SkiB)) 
^max = max — — < 1 (36) 

BeMd {B,B) 

and the maximum is attained for X„iax = 1 and B^ax = ['^aV^'^i'^)- n 



With the bound on the spectrum at hand, it is now straight forward to prove theorem Q 

Proof: Define e{n) e Md, as e(n) = ^"(p - a). By Lemma|5] we get ||e(n)||f < 
X^(r"(p),r"((T)) = Xfe(")- In the matrix representation, |e(n)) = e(n) (g) 1|/), we can 
rewrite Xfe("-) = (e("-)| f^^ k('^))- Note that also, \e{n + 1)) = T\e{n)) and so, 

Xl(n) -xlin+l) = (e(n)l f)^ |e(n)) - {e{n)\f^ Cl'; f|e(n)) (37) 

= (e(n)l [f7^]i/2 (l - Q,^Q,^ [n';]'/'\ein)). (38) 

Due to lemma (fTOb we know that the spectrum of 5^ = Qk Qk, which is equal to the square 
of the singular values of Qk, is contained in the interval [0, 1]. Hence, 

{einm^,]'/^[l~Sk)m'/Mn)) (39) 

> il-sl){einm',]'/'Y.P'^ P']'^Mn)). (40) 

The sum is taken over spectral projectors P^' of Sk — X^qC'^q)^^"' ^P^rt from Pq 
which projects onto [n'^]-^^^\a). In particular, P^ = [n'^]-^^^\a){I\[n'^]~^^^, so that 

(e(n)|[J7j]i/2Po'= [r2^]i/2|e(n)) = (e(n)|o-)tr[r"(p - cr)] = 0, by trace preservation of 
T. We can write, 

X?-H-X^(n + l)>(l-(.tf)x^(n). (41) 

Rearranging terms completes the theorem. D 

Remark: The fact, that the singular values of Qk are always smaller or equal to one 
justifies the use of the generalized x^ -divergence as the appropriate distance measure to 
bound the convergence of an arbitrary channel. It is tempting to use the Hilbert-Schmidt 
inner product to give an upper bound to the trace norm. This can always be done at the 
cost of a dimension dependent prefactor, since on finite dimensional spaces all norms are 
equivalent. However, when doing so a problem arises if one tries to bound the convergence 
in terms of the spectral properties of the map Shs — T* o T. It is in general not ensured 
that the spectrum will be bounded by one. In fact, for every non-unital channel T, Shs will 
have an eigenvalue larger than one ||3 Il1 . The similarity transformation of the channel T with 
jj^fcji/2 ^jfgj-g (jjg singular values, but of course leaves the spectrum invariant. Furthermore, 
it is a well known fact [22] that the singular values of a square matrix log-majorize the 
absolute value of the eigenvalues. As the spectrum of Qk is bounded by one (and equal that 
of T by similarity), we conclude that its second largest eigenvalue is always smaller or equal 
to its second largest singular value. One can also give a general bound in terms of the second 
largest eigenvalue of T 1321 . but one is then confronted with a potentially severe dimensional 
prefactor. 

For some instances of the inversion fjj it becomes immediately evident that the sym- 
metrization Sk has the desired spectral properties without making use of the monotonicity 
of the x| -divergence. It can occur, that Sk is again similar to a quantum channel that is 
of the form T^ = [Q'^]^^/^ o Sk o [fl^Y^^. A possible example of such an inversions is 

fla~ = La Ra ■ This is however not the generic case, most inversions will lead 
to maps that are not completely positive any longer. It would be very desirable to find 
other such examples, as they mirror the classical situation where the symmetrized maps are 



10 



always probability transition matrices, and because these specific inversions allow for clean 
contraction bounds as seen in section lll.B. 



It is clear from the discussion above that the singular values of Qk play a crucial role in the 
mixing time analysis presented here. This seems to contradict the general understanding that 
the convergence is determined by the spectral properties of the channel T in the asymptotic 
limit. This can however be understood as follows: the matrix Qk is similar to T, i.e. Qk = 
|-j=jfeji/2. ji.|-j^fcj-i/2^ so the spectra of Qfc and T coincide. The following lemma establishes a 
relation between the singular values and the eigenvalues in the asymptotic limit. For a proof, 
see e.g. B pg.180. 

Lemma 11 (Singular values) Let Qk e Md^ be given, and let so{Qk) > ■■ ■> ScP-i{Qk) 

and {\i{Qk)}i=Q...d?-i denote its singular values and eigenvalues, respectively with 
|Ao(Qfe)|> ...> |A<i2„i(Qfe)|. Then 

\iu,[s,{Ql)Y'^ = \\{Qk)\ V* = 0...d2_l (42) 

In the limit of n — > oo applications of the quantum channel, we can start blocking the channel 
in TO subsequent applications T^™) = T™ and bound the convergence rate as a function of 
the singular values of the corresponding Qj," , which indeed converge to the eigenvalues of 
the original cp-map . Convergence following the eigenvalue is therefore only guaranteed in 
the limit of n — > co, and this would indeed be the case, when e.g. the eigenstructure of the 
original cp-maps contains a Jordan block associated to the second largest eigenvalue. Note, 
that convergence in the above lemma goes typically as 1/n, which is very slow. Hence for 
finite n, convergence is governed by the singular values of Qk as opposed to the eigenvalues. 
The bound derived in (|9]l is an absolute bound for finite n and clearly leads to a strictly mono- 
tonic decay. Note that in the case that the second largest singular value is also equal to 1, this 
can then always be cured by blocking the cp-maps together Finally, it is worth mentioning 
that the convergence can in fact be much more rapid if one starts in a state "closer" to the 
fixed point. In particular, if the initial state is such that p — a oc Yk, k > 2, where Yk is the 
eigenvector corresponding to A^, then the convergence will be governed by the magnitude of 
Afc. Furthermore, if instead of a single fixed point, we have a fixed subspace, or a collection 
of fixed subspaces (with or without rotating points), then the convergence to this fixed sub- 
space will be governed by the largest eigenvalue whose magnitude is strictly smaller than one. 

Thus far we have only considered the time-discrete case, it is however straightforward 
to give a similar bound for time-continuous Markov processes, that are described by a one 
parameter semi-group. The following lemma bounds the trace-distance as a function of i € 
]R(|: The proof of the following lemma is very similar to the proof of the time discrete case, 
we will therefore omit it here. 

Lemma 12 (Time-continuous bound) Let C denote the generator of a time continuous 
Markov process, described by the master equation dtp — C{p), with solution pit) £ Sd 
V t G [0, +oo) . Furthermore let a G 5j" denote the fixed-point C{<t) — 0, then 

\\pit)~a\\l<e''^'xlip{0),a). (43) 

Here, li < refers to the second largest eigenvalue of 

Afe = [rj^]i/2 o c* o [n^,]-^/^ + [n'^,]-^/^ o c o [n^i^/^. (44) 
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The symmetrization for the generator of the time continuous Markov process is additive as 
would be expected. Furthermore, we note that the monotonicity of the x^ -divergence ensures 
that the spectrum of A^ is never positive, based on a similar reasoning as given in lemma 

Coll. 



B. Contraction Coefficients 

In the following we study the contraction of the x^ -divergences under quantum channels, 
and its relation to the trace norm contraction. We consider general contraction rather than 
contraction to the fixed point because analytic results are more readily available, and because 
these bounds are in a sense the most stringent one can require. We focus primarily on the 
mean a-subfamily of x^-divergences. 

Let us define the following contraction coefficients which we call the x^- and trace norm- 
contraction respectively: 



^^^ xlinp),T{a )) 



Vx (^) = sup TfT:r\ "^^S) 



and 



VUT)^ sup M^^^= sup i||T(V.)-r(0)||i, (46) 

where T : Aid -^ A^disa quantum channel, and the last equality is seen simply by 
convexity of the trace norm. 

We first upper bound the trace-norm contraction in terms of the x^ contraction, which is a 
generalization of a result in ||25I1 : 

Theorem 13 For all a G (0, 1], and a quantum channel T : Aid ^ Aid, 

VtriT) < ^Tj^iT) (47) 

Proof: From Lemma|5] we have that \\T{p - a)\\l < xl{T{p),T{a)), for all p,a e Sd- 
Let N be traceless and hermitian, and note that it can be written as iV = N^ — 7V_, where 
N^, iV_ are positive definite and orthogonal in their support. Now let P = |Af|/||iV||i and 
recall that \N\ = N+ + 7V_, then we get ti-[NP-°'NP°'-^] = \\N\\l, for every a e (0, 1]. 
Also, 



\\T{N)\\l ^ tT[T{N)T{P)-°'T{N)T{P)°' 
II trlNP-^'NP" 



2 < ' ' :,r:.^_^.;^^.; ' — - m 



where the inequality is in the numerator, and the denominators are equal, by the previous 
observation. Taking the supremum over all traceless hermitian N on the left hand side and 
identifying p — a — N,P — a then gives desired result. D 

We now provide a lower bound to the trace norm contraction for primitive channels: 
Theorem 14 Given a quantum channel T : Aid ^ Aid, 

„=i/2(y) < rjtriT) (49) 
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First we introduce an eigenvalue type min-max characterization of the x^ -contraction, and 
then show that this eigenvalue must be smaller than the trace norm-contraction. 
Let P > 0, and consider the following eigenvalue equation: 

r\A) = Clp'fmT^p)t\A) - X\A), (50) 

where ^x = ^'x~ • It T has a non-trivial kernel, then flTiP) should be understood 
in terms of the pseudo-inverse. First note that F is a quantum channel, so its spectrum is 
bounded by one, and that it reaches one for A = P. Also note that F is similar to a hermitian 
operator, so it has all real eigenvalues, so we can take the eigenvectors to be hermitian. Then 
rewriting ( fSOb as T^Clj^(^p-^T\A) = XClp\A), we can express the second largest eigenvalue as: 

, ,^ p, (JV|ftA f|JV) 
Xi(i,P) = sup 

{N\np{P)}=o,N=N-t {N\^p\N) 

tr[T{N)T{P)-^/^T{N)T{PY^''^] 
~ ,.N=Z=m tr[7VP-i/2iVP-i/2] • ^51) 

1 /'? 
Clearly, by maximizing over all P, one recovers 77^' (T). We now prove the above theo- 
rem: 

Proof: Let Ni be the eigenvector for which Ai satisfies the eigenvalue equation dSOl l, and 
recall that A^i is Hermitian and traceless. Then, 

AillTVilli = ||F(r(iVi))||i < ||T(Ari)||i (52) 

because F is a channel, and 

^ ||r(iVi)||i ^ \\T{N)\\, 

Ai < ^ — < sup ii^ii = mr, (53) 

||iVi||i tr7V=0,Af1=W IH^IIi 

taking the supremum over positive P completes the proof. D 

Remark: Theorem [14] gives a computable lower bound to the trace norm contraction. A 
key subtlety in the argument is that [57p(j4)] ~ ^/PA\/P is a completely positive (CP) 
map (with a single Kraus operator \/P) which implies that F is a quantum channel. In 
general, ilp is not even positivity preserving. Another exception is the monotone metric 
associated with the usual logarithmic relative entropy for which k{w) — -^j^- It is well- 
known iHlllllTtl that rtp^iA) can be written as 

f]'°s(A) = / A dx (54) 

P ^ ' Jn P + Xl P + Xl ^ ^ 



which is clearly CP. An analogous lower bound was shown in 111611 for this map using a 
similar argument. Clearly, this can be extended to any monotone metric for which flp is CP; 
however, we do not know of any other examples. 

Very little is known about the ordering of the general rjk contraction coefficients. In par- 
ticular. We do not know whether whether tj^°^ is smaller or larger than 77^" ' . However, it 
is known lllql that 7]k are not all identical for different k <E K..; because examples can be con- 
structed using non-unital qubit channels. Theorem[T3]can readily be extended to any metric 
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associated with k E IC. However, it seems unlikely that Theorem[T4lholds in general,. Thus, 
we can conclude 

max{7y^=i/2(r), ,7x'(T)} < ^,,{T) < inf ^/^) . (55) 



Note that if instead of maximizing over all P we only consider contraction of the map 
to the steady state, and denote it fj{T) = ri{T)p=cr, then from the above arguments one 
immediately gets: 

fj'^iT) < vUT) < r,tr{T) < 1 (56) 

Combing this with the previous bounds above, we have 



Ai < sr'^' = f^i=^i^ < vT'^' < ^, < v^r'/'- (57) 

Moreover, k{w) — y^ on the right can be replaced by any k G K., and that on the left 
by k{w) = {w — 1)~^ logui. It is very tempting to conjecture that fj^j. < f]" and/or that 

Vtr < y Vx~ ' tiut simple numerical counterexamples show these to be false. 



IV. QUANTUM DETAILED BALANCE 

The detailed balance condition is often crutial in the analysis of classical Markov chain 
mixing times, as it ensures several convenient properties of the Markov chain. In particular, 
it implies that the classical probability distribution with respect to which the stochastic map 
is detailed balanced is a fixed point of the chain. Furthermore, detailed balanced stochastic 
maps have a real spectrum. In this section we generalize the notion of classical detailed 
balance to quantum Markov chains. Alternative definitions of quantum detailed balance have 
been given in the literature: i28ll33ll35Ll36ll and references therein. Central to our approach is 
the operator Qk as previously introduced in Lemma|9l In the literature for classical Markov 
chains an analogous matrix exists and is often referred to as the discriminant. 

Definition 15 For a channel T : A4^ -^ Md and a state a G iSJ" with corresponding 
inversion f2^ as defined in ( 1771 ), we define the quantum discriminant ofT as, 

Qk^[nlY/^oTo[nlr^/\ (58) 

We recall that the convergence of an arbitrary quantum Markov process can be bounded by 
the singular values of Qk- Classical detailed balanced Markov chains have the property that 
the corresponding discriminant becomes symmetric. We shall therefore define the quantum 
generalization by requiring that for a quantum detailed balanced process 

Ql = Qk- (59) 

This immediately allows to make a statement about the spectrum of quantum detailed bal- 
anced maps. Due to the hermicity of the matrix representation of the map ( |58] ) we can imme- 
diately deduce, just as for classically case, that the quantum channel T has a real spectrum. 
For detailed balanced maps, the second largest eigenvalue in magnitude coincides with the 
second largest singular value. Furthermore, we would like to point out that this is actually 
not just a single condition for quantum detailed balance but a whole family. Hence every 
different inversion ilj gives rise to a different condition for detailed balance. We therefore 
define as the quantum generalization of detailed balance: 
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Definition 16 For a channel T : M.d ^ M.d and a state a G 5j", we say that T obeys k- 
detailed balance with respect to a with k G K. , when 

[f]fe]-ior* = ro[f7^]-i. (60) 

A consequence of this definition is that cr is a fixed point of T. 

Lemma 17 Let a ^ Sdbe a state and T a channel that satisfies the detailed balance defini- 
tion\l6\with respect to fl^, then a is a steady state ofT. 

Proof: Recall that the inverse is given by [r^^]^^ = Rcrl{^a,a), where J{w) — \/k{w). 
Hence, since fc(l) = /(I) = 1, we have 

[0^]-i(l)=i?,/(A,,,)l = i?,l = a. (61) 

Now, since furthermore T*(l) = 1, we have that 

na)^To[nlY\l)^[nlY\T*{l) = [nl]-\l) = a. (62) 

D 

Given a probability distribution on some set of states, it is desirable to have a simple criterium 
to check whether a completely positive map obeys detailed balance with respect to the state 
generated from the distribution. This criterium may then serve to set up a Markov chain that 
actually converges to the desired steady state. 

Proposition 18 Let {\i)}i be a complete orthonormal basis of Sj and let {ni}i be a proba- 
bility distribution on this basis. Furthermore, assume that a quantum channel T obeys 

"" 't\T{\n}{m\)\j}= ,/' ^ {m\T{\j}{t\)\n}, (63) 



then a ~ ^^ /^i|*) (*| and T obey the detailed balance condition with respect to 17^. 

Proof: Note that {|«)0|}ij forms a complete and orthonormal basis in the space Aid with 
respect to the Hilbert-Schmidt scalar product. We can therefore express equation (|60] | in this 
basis. The individual entries are equal due to 

tr[(|m)(n|)^[f7^]-ioT*(|j)(z|)] = a^„ fc-^ (a.,„/m„) tr [T( |m)(n| )t (|j)(*|)] = (64) 
Hn k-^ (MmMO (J| T{ \n){m\ ) \j) = ^i, k^^ (a^jM) (™I T{ \j){i\ ) \n) = 
M. k-' (M,M)tr [(|m)(n|)^T(|j)(z|)] = tr [{\m) {n\)^ T o [^l]-\\j) (^\) . 

n 

Remark: We note that the different quantum detailed balance conditions coincide for clas- 
sical channels, i.e. for stochastic processes that are included in the framework of quantum 
channels. Define the following "classical" Kraus operators: 

^fj^ ^/pij\^i){j\ and a state, cr = ^ /x,|i)(i|. (65) 



In this case, the condition of Proposition [18] reduces to the classical condition. This can 



be seen when considering the channel T'^^{p) = ^^ Af.pAl.^ and checking for detailed 



balance with respect to sigma, since 



-^ {i\ T-\ \n){m\ ) \j) = ^-^ii!^5„„<5,^.p„ 



k {Pn/l^m) k {p-n/fJ-r. 
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and 



Hi 



k{^j/lii 



mT^^{\n){m\)\j) 



k{fXj/fi,) 



(66) 



However since fc(l) = 1 we are just left with the classical detailed balance condition 
HiPni = ^J.„Pin for all pairs i, n. 

A natural question to ask is therefore, whether the different detailed balance condition 
are all identical. To see that this is not the case, consider the example given by the Kraus 
operators of a single qubit, i.e. ^ — (D^, 



This channel has the unique fixed point 



and Ao — — 



1 /5 1 



1/1-1 



1 -1 



'"^eVi 1 



(67) 



(68) 



From this channel it is now possible to construct a channel that obeys detailed balance 
with respect to the inversion given by choosing k{w) = w^^^^, that is the inversion reads 



ri" ' ^ La Ra ■ We consider therefore the symmetrized map. 



T, - 



^a = l/2 



. T* O r2" = l/2 o rj._ 



(69) 



For the specific instance where fl 



a=l/2 



is given as above, we are assured that the map Ts 



is again a quantum channel, because one immediately finds the Kraus representation for 

indiv 

V2 



Ts{p) — X^i, ^ijP^ij ^s Bij = \/^AI [^/a] ^Aj. The individual Kraus operators read. 



R -3f 1 1 

^" ~ 5 1^ 1/2 1/2 

_ v2 f 3 3 
-1 -1 



and Bi2 



1 -1 



5 11/2 -1/2 



(70) 



■^21 - lo 



1 

and B22 = - 



-1 1 



a=l/2 



The channel Tg satisfies detailed balance with respect to Qa by construction. This 

channel however does not satisfy detailed balance with respect to the inversion fj^"''^'' = 
2 [Lfj + Ra] as can be seen directly by evaluating the detailed balance condition in terms 
of the matrix representations, 

-1 



n 



Bures 



■ T t - T, • 



n 



Bures 



^'--^i^^Y + Y^M, 



where 



Y = 



-1 

1 



(71) 



(72) 



The family of quantum detailed balance conditions is therefore much richer than the classical 
counterpart. 
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V. QUANTUM CHEEGER'S INEQUALITY 

In the context of classical stochastic processes a very powerful formalism has been devel- 
oped, often referred to as the conductance bound or Cheeger's inequality, to bound conver- 
gence rates of stochastic processes. We will generalize this to the quantum setting in this 
section. Similar results have appeared in 02911 . The gap of the map Sk is defined as the dif- 
ference between the largest and second largest eigenvalue, A = 1 — Ai. The gap can be 
characterized in a variational fashion f22^ . 

Proposition 19 The gap of the map Sk = [^^V^^'^ oT* on^oTo [n^]-^^^ is given by 

{X,{id-Sk)X) 



^e*^^ \\\{X®^-^®X) 



-^^ (73) 

\hs 



where \\A\\^g = tr[A' A] denotes the standard Hilbert-Schmidt norm and ( ,) the corre- 
sponding Hilbert-Schmidt scalar product. 

Proof: The eigenvector that corresponds to the eigenvalue Aq = 1 of Sk is given by ^/ct. 
The gap can therefore be written as li22il : 

^ - mm 1 - ^ ^ ^' 



X6Afd;tr[XV^=0 tr[XtX] 

ti[X^{X-S{X))] 



A'eMd:tr[Xv^=o tr[XtX] - tr[X y/o]'^ 
tr[XHX - S{X))] 



(74) 



mm 

xe 



^^'i i||(X(g)V5^-V^®x; 



,2 ' 

\hs 



Note that the constrained tr[Xy^ = can be dropped in the last line. Suppose that 
tr[Ar-y/CT] = c, we can then define X' = X — C\fa and vary X' since the equation is in- 
variant under such shifts. D 

Throughout the remainder of this section we consider unital quantum channels, i.e. maps 
which obey T(l) — 1. For this case it is ensured that already the simple map S = T* o T 
has a spectrum that is contained in [0, 1], since all fi^ coincide and correspond to the identity 
map. The x^ -divergence just reduces to the standard Hilbert-Schmidt inner product times a 
prefactor given by d = dim(io). In the case of a detailed balanced stochastic map it even 
suffices to just consider the map itself. In either case we will denote the corresponding map 
as S from now on. The variational characterization of the gap A now allows us to give an 
upper as well as a lower bound to the second largest eigenvalue of S. 

Lemma 20 Let T : Add -^ Add be a unital quantum channel. Then the second largest 
eigenvalue Ai of its symmetrization S — T* oT, is bounded by, 

1 - 2/i < Ai < 1 - i/i^, (75) 

where h is Cheeger's constant defined as, 

h= min trP-n,)^(n.)]^ ^^^^ 

nA,tr[n^]<d/2 trpA] 

The minimum is to be taken over all projectors Ha on the space A <Z S^, so f/iaf trp^i] < d/2. 
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Proof: An upper bound to the gap is immediately found by choosing X = Ha- Due 
proposition iT% we can write: 

^ ^ tr[n^(id-^)(n^)] 



tr[n^] - itr[n^]^ 
tT[{i-nA)S{nA)] 

itr[(l - UA)]tT[UA] 



< 2h, (77) 



where in the last line we have used that tr[l — Ha] > d/2. 

For the lower bound, we can restrict the minimization in ( |76] | to diagonal projections. Further- 
more, when considering only unital quantum channels, it is possible to reduce the problem of 
bounding the gap A to that of bounding the gap of a classical stochastic process. To see this, 
let us work in the basis where the eigenvector Xi ^ M.d corresponding to Ai is diagonal. We 
shall assume wlog that X\ = Xi. In this basis, we can write X — '^Xi\i){i\. The numerator 
then becomes 

tr [XHX S{X))] =J2x.x,itr[\z){z\\j){j\] ~ tr [N)(z|5(|j)01)] 

i ij ij 

We introduced the matrix Py = {i\S{\j){j\)\i), which is a symmetric non-negative matrix 
which obeys Pij > , J^i Pij ~ 1 ^i^^ P^ ~ P- Hence P is doubly stochastic. Performing 
the same reduction in the denominator we obtain 



l^\\{X®l-l®X)\\lg^^^ Y^{x,-x,f (79) 

ij 

Hence, we arrive at the classical version of Mihail's Identity yfl. 



A = min ^\' \ -^ . (80) 

{x,} 1/d l^,j[x,-Xjy 

Given the classical version of Mihail's identity, the proof of the lower bound is the same as 
in the classical case. For completeness we repeat it here. First, we define, Zi = \xi\xi and 
write, 

/ , Pij I ^i ~ ^j\ ~ / , Pij\\Xi\Xi ~ FjFil — / , V Pij V Pij \\Xi\ + l^iljl^^i ~ ^j) 
ij ij ij 



< /^P^,(x.-a;,)2 /^P.,(|x.| + |x,|)2, (81) 

V ij V ij 

where we used Cauchy-Schwartz in the last step. Consider now, 

^P.,(|a;.| + \x,\f = 2(^x2 + ^|x.|P.,>,|) < 4^|x,p. (82) 

ij i ij i 

Furthermore, note that we can bound, 

lldY^{x.~x,f<2/dY,xl^2Y,\z,\. (83) 
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We are therefore left with a lower bound to Mihail's identity, which holds for all choices of 

{Xi} 

1 (z-^ij ^ijl^i ~ ^j\\ ^ Z^ij J^^ijyXi — Xj) 



We shall now assume, that Xi > everywhere and we can hence drop the absolute values in 
the definition for the z^. This is assumption is valid since we are free in adding an arbitrary 
constant Xi -^ Xi + c to make all Xi positive. Note that we therefore are left with a lower 
bound to the gap of the form. 



(85) 




2 
Let's focus on the right side of the inequality. Since, 



2 J2 ^y(^?-2;|) = 4 ^ P,W tdt = 4 t Y, P,,dt, (86) 

ij : Xi>Xj i,j : Xi>Xj ^J ij : Xi>t>Xj 



and furthermore. 



ij : Xi>t>Xj ieA{t) jeA^it) 



we can bound. 



4 I t Y^ P,jdt>hAl i ^ e(i-x,)rfi = 2/i |^a;n , (88) 

•^0 ij:x,>t>Xj "^0 ieA{t) \ i J 

where we defined h as in the same fashion as above. We have therefore found the desired 
lower bound for the spectral gap of the map S. D 



A. Example: Conductance bound for unital qubit channels 

A convenient basis for the matrix space A^2 associated with the Hilbert space S) — (D^ 
is given in terms of the Pauli basis {l,ax,(Jy,az}- In this basis a density matrix p E S2 
can be parametrized in terms of its Bloch vector r e M'^. In the Bloch representation the 
density matrix reads p = -i (1 + r • S), where S = (cr^, ct^, CTz). It is also straight forward 
to determine the matrix representation of a quantum channel T : A^2 — >^ A^2 with respect to 
the Pauli basis. A general channel can be written as a matrix T G A^4. 

r.(jO). (89) 

The channel acts on a density matrix via T{p) = T(i (1 + r • S)) = 5 (1 + (t + Lr) • S). 
It can be shown, that the map T is unital if and only if t = 0. Let us now consider the 
optimization for Cheeger's constant h as given in Lemma ( l20l i. Given the constraint, we have 
to vary all one dimensional projectors Ha ~ \tp) {ip\ with |j \ip)\\2 = 1, so that 

h= min tr[(l-|V;)(^|)^(|V^>(^|)]. (90) 
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The symmetrized map S of the unital channel T, with t = 0, now assumes the matrix repre- 
sentation, 

1 



Furthermore note, that any projector |V')(V'I G '^2 can be parametrized via a Bloch vector 
a e TR^ that obeys ||a||2 = 1. The minimization for Cheeger's constant reduces therefore to 



mm 



1 - (a| LtL |a), (92) 



l|a||2 = l 

where (a|b) denotes the canonical scalar product in ]R^. The minimum is attained when a 
is the eigenvector associated with the largest eigenvalue sf of the matrix L^^L. Hence for an 
arbitrary single qubit unital channel, Cheeger's constant is given hy h = 1 — sf, where si is 
the largest singular value of the matrix L and hence the second largest singular value of the 
channel T. We see that the conductance bound as stated in ( |20l ) is indeed satisfied, since 

2sl-l<.sl<^{l + sl). (93) 



VI. CONCLUSION AND DISCUSSION 

We have seen that by generalizing the x^-divergence to the quantum setting, many of the 
classical results for the convergence of Markov processes can be recovered. The general 
perception, that the convergence should be governed by the spectral properties of the quan- 
tum channel could be verified in the asymptotic limit. The fact that we were working with 
non-commuting probabilities gave rise to a larger set of possibilities of defining an inversion 
of the fixed point density matrix, all of which give rise to a valid upper bound for the trace 
distance. An interesting question is how the different singular values s'H of the corresponding 
quantum discriminant relate to each other The generalization of the x^ -divergence also led 
to the definition of detailed balance for quantum channels. Again, not only a single condi- 
tion for quantum detailed balance exists but an entire family of conditions each determined 
by a different function k E JC, all of which coincide in the case when we consider classical 
stochastic processes on a commuting subspace. The quantum concept of detailed balance 
therefore appears to be richer and allows for a wider set of channels to obey this definition. 
The conductance bound that was derived could only be shown for unital quantum channels. 
However we would like to point out, that it is possible to give conductance bounds for classi- 
cal maps when the Markov chain is not doubly stochastic. The fact that in general we may not 
assume that the fixed point of an arbitrary channel commutes with the eigenvector associated 
to the second largest eigenvalues seems to hinder a generalization for non-unital channels. 
Moreover, the classical conductance bound has a nice geometrical interpretation in terms of 
the cut set analysis and the maximal flow on the graph associated to the stochastic matrix P^ . 
When investigating general quantum channels such a nice geometric interpretation seems to 
be lacking. For unital quantum channels Cheeger's constant can also be viewed in terms of 
the minimal probability flow of one subspace to its compliment. 
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Appendix A: Families of divergences 



The most widely used family of divergences, often called a-divergence 11371 Chapter 7], is 
associated with the functions 



k 



WYD 



(w) 



(l-w")(l-'u;i-") 
a(l-a)(l-w)2 



for 



ae [-1,2] 



(Al) 



This family is sometimes called the WYD divergences, because it arises from an extension 
of the Wigner-Yanase-Dyson entropy Olg, |40] associated with the (unsymmetrized) function 
g{w) — ^(i_^\ {w — w°'). In the limit a — > 1 this yields Il38tl the familiar (asymmetric) 

relative entropy H{p, a) — TTp{\ogp — logcr) and 57 p^ given by ( l54l i. Like the family of 
divergences introduced here, the minimal WYD divergence occurs for a = 1/2, it is convex 
in a, symmetric around a = 1/2 and yields the maximal ^^ when a = —1 or 2. However, 



a = 1/2 gives k^,^ {w) — 4(1 + ^/w) which is quite different from k"\°^''^{w) = v^. 
The WYD family is often studied only for a £ (0, 1); it was first observed by Hasegawa in 
II39II that it yields a monotone metric if and only if a G [—1,2]. 

The metrics associated with fc™°'^"(i(;) and k'^^^{w) both give increasing families for 
a > i and both yield the maximal metric k{w) = (1 + w){2w) for a the maximal values 
of 1 and 2 respectively. However, neither reduces to the minimal metric k{w) = 2/(1 + w). 



The measure S{s — a) in ( fTSI ) leads to the family ka{i 



_ {l+ay 



(l+w) 



for a e [0, 1] 



2 {l+wa){w+a) 

which reduces to the the maximal and minimal functions for a — 0,1. However, this family 
is neither increasing nor decreasing. Hansen ll4lll has found families of functions which 
increase monotonically from the smallest to the largest of which we mention only 



kaiw) 



W 



1 



for a e [0, 1] 



(A2) 



Appendix B: Proof of a key inequality 

The proof of the contractivity of a general Riemannian metric is based on the following 
theorem first proved in I1I6] . 

Theorem 21 For a channel T : Add ^' Md, we have that. 



tr 



At- 



1 



Ra + SL, 



-A 



tr 



TM^- 



Ra 



sLo 



-A 



> 



(Bl) 



tr 



r(A)i 



R' 



T(a} 



+ SL' 



'T(p) 



-T{A) 



Proof: Let a > 0, then tr[AVA] > 0, and tr[A^Acr] > so that L^y as well as R^ 
are both positive semi definite super operators on the matrix space. Therefore we infer, that 



21 



for a positive p > the operator Ra- + sLp is also positive. We define a matrix X = 

[Ra + sLp]-^^^{A) + [Ra + sLpY/^T*{A) and furthermore B = [RT(a}+sLT(p)]~^T{A). 
Since tr[Xl^X] > 0, we have that 



tr 



At 1 



i?a + sLp 



-tr [T*(Bt)A]-tr [AtT*(B)]+tr [T*(St)[i?^ + sLp]r*(S)] > 0. 

(B2) 



Furthermore note, that 



tr [A'fT*{B)] -tr [T*{B'f)A] = -2tr 



^(^^)l5 ^7 ^(^) 



(B3) 



It therefore suffices to show that we are able to bound the last term in ( IB2l i by the right side 
of the inequality JBll l. Note, that 

iT[T*{B^)[R„ + sLp\T*{B)\ ^tr[T*{B^)T*{B)a + sT*{B^)pT*{B)] (B4) 

< tr [T*{B^B)a + sT*{BB^)p] , 

sincepjCr > and due to the operator inequality T*(i3t)T*(i3) < T*{B^B). This inequality 
holds for any B since T is a channel and by that trace preserving, hence T*(l) = 1. With 

tr [T*{B^B)(j\ = tr [^^^^(cr)] we can write 



tr [T*{B^)[R„ + sLp]T*{B)\ < tr [B^^BTia) + sB'^BT{p)\ (B5) 

= tr [B^[Rt(^^) + sLt(p)]S] = tr [B^T{A)\ = tr 



r(ylt) ^- T{A) 



RT(a) + sLt(p) 
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